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Abstract —In (TJ a syndrome counting based upper bound on 
tbe minimum distance of regular binary LDPC codes is given. 
In tbis paper we extend tbe bound to the case of irregular and 
generalized LDPC codes over GF{q). The comparison to the 
lower bound for LDPC codes over GF{q) and to the upper 
bound for non-binary codes is done. The new bound is shown to 
lie under the Gilbert-Varshamov bound at high rates. 

I. Introduction 

In this paper we investigate the minimum code distance 
of LDPC codes El, 0 over Fg. Such codes have good 
error-correcting capabilities, efficient encoding and decoding 
algorithms. All of these makes the codes very popular in 
practical applications. 

In 0 a syndrome counting based upper bound on the 
minimum distance of regular binary LDPC codes is given. 
In this paper we extend the bound to the case of irregular and 
generalized LDPC codes over Fg. 

Our contribution is as follows. First we derive the upper 
bound for generalized LDPC codes (we assume the Tanner 
graph 0 to be a regular one) over Fg. The bound depends 
on the weighQ enumerator of the constituent code. Second 
we derive the upper bound for irregular LDPC codes (we 
assume the Tanner graph to be an irregular one) over Fg. The 
constituent code in this case is a single parity-check (SPC) 
code over Fg. We compare the new upper bound to the lower 
bound for LDPC codes over Fg 0 and to the upper bound 
for non-binary codes 0. At last we show the derived bound 
to lie under the Gilbert-Varshamov bound at high rates. 

11. Generalized LDPC codes 

In this section we obtain the upper bound on the minimum 
distance of generalized LDPC codes. We use Elias-Bassalygo 
type arguments 0. 

Let us briefly consider the construction of generalized 
LDPC code C over Fg. To construct such a code we use 
a bipartite graph, which is called the Tanner graph 0 (see 
Fig. HI)- The graph consists of N variable nodes vi,V 2 , ■ ■ ■ ,vn 
and M check nodes Ci, C 2 ,..., Cm- In this section we assume 
all the check nodes to have the same degree nq (such Tanner 
graphs are called right regular ones). We associate constituent 
codes to each of the check nodes. In this section all the 

'Here and in what follows by weight we mean the Hamming weight, i.e. 
a number of non-zero elements in a vector. 
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Fig. 1. Tanner graph 


constituent codes are the same (we denote the constituent code 
by Co). We assume Co to be a linear [no, Rq, (io]-code over Fg. 
Let us denote the parity-check matrix of the constituent code 
by Ho. The matrix has size mq xnq, where mq = {1 — Rq)nq. 

Let G{s,nq,dq) be the weight enumerator of the code Co, 
i.e. 

no 

G(s, nq, dq) = 1+Y^ A(i)s\ 

i=do 

where A{i) is the number of codewords of weight i in a code 
Co- 

To check if r = (ri, r 2 ,..., rjv) G F^ is a codeword of 
C we associate the symbols of r to the variable nodes {vi = 
Ti, i = 1,..., N). The word r is called a codeword of C if all 
the constituent codes are satisfied (the symbols which come to 
the codes via the edges of the Tanner graph form codewords 
of the constituent codes). 

It is clear the resulting code C is linear, so it has a parity- 
check matrix associated to it. We denote the matrix by H. The 
code is over Fg and has the length N . 

By S we denote the resulting syndrome of a generalized 
LDPC code, i.e, for a received sequence r 

S = Hr^. 

The syndrome consists of the constituent code syndromes and 
can be presented in such a way 

S = (Si,S2,...,Sm), 






where S^, i = 1,..., M, is a syndrome of the i-th constituent 
code. 

Let us introduce a notation. For a discrete random variable 
X, Hq{X) denotes the entropy of X, i.e., 

Hq{X) = - ^ Pr(X = a;) logg Pr(X = x). 

X 

In what follows we will need the fact formulated in the 
Lemma below 

Lemma 1: Let X be the random variable taking t values, 
let p* > 1/t and let 


After applying Lemma [T] we have 

' B{q,u}N,6N) 


HqiS) > -log„ 


a) ( 9 - 1 )-^, 


> N{hq{oj) - Rcwiq,^^,^) + o{l)), (2) 


where Rcw{q, is an upper bound of the rate of constant 
weight code. 

Now we proceed with the right part of ([T]i. Let us consider 
the i-th constituent code, recall, that Si is a random variable 
and it is easy to see that 


Pr(X = Xi) = Pt <p*, Vi = 1,..., f, 


then 

Hq{X) > - loggip*). 


PO = Pr(Si = 0) 

1 

“ CX-j-ir 




,i=0 


N -no 
W-i 


(9-1) 


W-i 


Let us introduce some additional notation. For a real number 
0 < X < 1 let 


We are interesting in asymptotic estimate when N ^ oo. 
In this case we have 


hqix) = -xlogQX- (1 - x) logQ(l - x) +x\ogQ{Q - 1). 

be Q-ary entropy function. 

We are ready to prove a theorem 

Theorem 1: Let C be a generalized LDPC code of length 
TV, rate R, minimum distance 5N, with constituent [no, 
code Co over F,. Let G{s,no, do) be the weight enumerator 
of Co- Then for sufficiently large N the following inequality 
holds 

hg{uj) - Ecw(g,cu, J) 1 

Vo(l-Fo(a;)) 

where 


R{C) < 1 — max 

S/2<uj<l 





and 


Po = 




.2=0 


+ 0(1) 


= (l-a;)”«G 


LJ 




no, do + o(l). 


After applying the log sum inequality for the entropy of the 
random variable we have 


poiu.) = (1 - cu)-G . 

Proof: Consider all possible vectors of length N, weight 
W = loN over Fg. We introduce an equiprobable distribution 
on such vectors. Recall, that S denotes the syndrome and S^, 
i — 1,..., M, denotes the syndrome of the i-th constituent 
code. S and S^, i = 1,..., M, are random variables. 

Note, that 

M 

Hq{S = (Si, S2, . . . , Sm)) < ^ Hq{S,). (1) 

2=1 


q^Q-l 

Hq{S^) = - ^ Pr(S, = Sj) log.j Pr(Sj = Sj) 

3=0 

q^a-l 

= -Po log,Po - = ®j ) ^ ®t) 

i=i 

< -Po log, Po - (1 - Po) log, 

= mohqmo{l - Po). (3) 

Finally after substituting of (|2]l and Q into ([U we obtain 


Our aim now is to estimate left and right parts of the 
inequality ([T]). 

Let us start from the left part of (|T]i. Let us consider 
the probability Pr(S = s) for some fixed syndrome s. It is 
clear that the number of vectors of weight uiN giving the 
syndrome s is upper bounded with the maximal cardinality 
B{q,ujN,SN) of a constant weight code with distance 5N 
over F,, in other words 


E < 1 - 


hq{uj) - Rcw{q,^^,d) 
/i,mo(l -Po(w)) 


o(l). 


(4) 


Now the maximization domain is 0 < w < 1, to finish the 
proof we need to reduce it to 5/2 < w < 1. We just need to 
note, that for w < 5/2 


Pr(S 


s) < 


B{q, ujN, SN) 


N 


Rcw{q,oj,S) = 0 

and maximum (for this sub-interval) is achieved at w = 5/2. 

















III. Irregular LDPC codes 

In this section we derive the upper bound for irregular LDPC 
codes over Fg. We assume the Tanner graph to be irregular. 
The constituent code in this case is a single parity-check (SPC) 
code over F^. 

First we note that an SPC code over F, is an MDS code. 
For the MDS code the number of codewords of weight W can 
be calculated as follows 


A{W) = [s^]Gis,do,no) 



Thus the enumerator of an SPC code over Fg is as follows 

G(s, do = 2, no) = i (1 + (<z - l)s)"« + ^{1 - s)"". 

To formulate a theorem we need a notion of row degree 
polynomial 

max 

p{x) = 

i—T min 

where pi is a fraction of rows of the parity check matrix of 
weight i, Tmin and Tmax are the minimal and maximal row 
weights accordingly. 

Theorem 2: Let C be an LDPC code of length N, rate 
R, minimum distance 5N, with row degree polynomial p{x). 
Then for sufficiently large N the following inequality holds 


R{C)<R[q,p{x)) 


= 1 — max 


hq{uj) - Rcw{q,i^,S) 


S/2<hj<l ^ 


Q-1 

9 




+ 0 ( 1 ). 


Proof: Consider the right part of O, we have 


M 


^ max 

= (1 - XI P^^t 

i—T min 


1-(1 



_y 


^ max 

= (1 — R) ^ ( Pihq 

i='rmln 


< (1 - R)hq 




These completes the proof. ■ 

Remark 1: We note that the bound improves the result from 
Q for the binary case. Recall that in H) in case of irregular 
LDPC code it is suggested to just substitute rmax to the bound 
for regular code. 

At last we prove that the upper bound is better for regular 
codes (with the same average row degree as irregular codes). 

Proposition 1: Let & > 0 be an integer, let p{x) be the row 
degree distribution of irregular code, such that ~ 

b and let preg = x^, then 


R{q,p{x)) < R{q,preg{x)). 


TABLE I 

Comparison of regular and irregular LDPC codes for q = 8, 
R = 0.9 


P15 

P20 

P25 

P30 

P35 

P40 

P45 

0 

0 

0 

1 

0 

0 

0 

0.25 

0 

0 

0.5 

0 

0 

0.25 

0.125 

0.125 

0 

0.5 

0 

0.125 

0.125 

0 

0 

0.5 

0 

0.5 

0 

0 

°Lnpc 

0.0512 

0.0493 

0.0500 

0.0512 


TABLE II 

Results eor high-rate codes, q = 8 


{£, no); R 

Sgv 

°Lnpc 

&BHL 

(3,10); 0.7 

0.1260 

0.2102 

0.2239 

(3,50); 0.94 

0.0179 

0.0263 

0.0355 

(3,100); 0.97 

0.0080 

0.0106 

0.0106 

(3,200); 0.985 

0.0036 

0.0043 

0.0073 

(3,500); 0.994 

0.0013 

0.0013 

0.0026 

(3,600); 0.995 

0.0011 

0.0010 

0.0021 


Proof: Let a > 0. By the concavity of the function 
we have 

p{a) > = Preg(Q!). 

These completes the proof. ■ 

IV. Numerical results 

In this section we present the numerical results. We use an 
upper bound derived in 0 as a function Rcw{q,^, 5). To 
the best knowledge of the author the bound is currently the 
best upper bound on the rate of non-binary constant weight 
codes. The results are shown in Tables mini and mil We use 
the following notation; 

• Sgy ~ ths Gilbert-Varshamov bound; 

• ^LDPC ~ bound for LDPC codes derived in the 

paper; 

• ^^LDPC ~ lower bound for LDPC codes from 0; 

• dsHL - the upper bound on the minimum distance of 
non-binary codes 0, which is an improvement of the 
Aaltonen bound Q. 

We hrst compare the values of the new estimate for 

regular and irregular codes. In Proposition [T] we proved that 
the bound is better for regular codes. Here we present some 
values calculated for q = 8 and R = 0.9. We hx the degree of 
the variable node 1 = 8. The results are shown in Table U We 
note, that for this case dpHL = 0.0638 and dev = 0.0328. 

For now let us compare 8gv^ ^up and 5bhl for the case 
of high-rate LDPC codes over Fg. In Table HJ the results are 
shown. We choose regular (f = 3,no) LDPC codes. We see 
that the new bound improves the best upper bound for non¬ 
binary codes {Sbhl)- We also see that at very high rates 
(i? > 0.994) the new bound lies below the Gilbert-Varshamov 
bound. We note that the interval of rates in which we observe 
this behavior is decreasing when q grows. For q = 2 the 
interval is i? > 0.985, for q = 16 the interval is i? > 0.997. 





































TABLE III 

Comparison to the lower bound, g = 64 


{£, no)-,R 

<5gv 

°Lnpc 

°LDPC 

(14, 16); 0.125 

0.7400 

0.7355 

0.8539 

(9, 12); 0.25 

0.5894 

0.5860 

0.7319 

(15, 24); 0.375 

0.4608 

0.4585 

0.6101 

(14, 28); 0.5 

0.3462 

0.3445 

0.4881 

(15, 40); 0.625 

0.2427 

0.2415 

0.3661 

(13, 52); 0.75 

0.1492 

0.1480 

0.2441 

(8, 64); 0.875 

0.0665 

0.0575 

0.1221 


At last we compare the new upper bound to the lower bound 
on the minimum distance of LDPC codes over F^. We use the 
lower bound from 11. The results for g = 64 are shown in 
Table Uni 
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V. Conclusion 

The new upper bound on the minimum distance of gen¬ 
eralized and iiTegular LDPC codes over F, is derived. For 
the derivation of the bound we used Bassalygo-Elias type 
arguments. The bound is proved to be better for regular LDPC 
codes over Fg. We compared the new upper bound to the 
lower bound for LDPC codes over F, and to the upper bound 
for non-binary codes. We showed, that at very high rates 
(R > 0.994 for q = 8) the new bound lies below the Gilbert- 
Varshamov bound. We note that the interval of rates in which 
we observe this behavior is decreasing when q grows. For 
q = 2 the interval is i? > 0.985, for g = 16 the interval is 
R > 0.997. 
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